منابع مشابه
Blog Track Open Task: Spam Blog Classification
Spam blogs or Splogs are blogs with either auto-generated or plagiarized content created for the sole purpose of hosting ads, promoting affiliate sites and getting new pages indexed. Splogs now rival generic web spam and e-mail spam, presenting a major problem to analytics on the blogosphere from basic search and indexing, to opinion, community, influence and correlation detection. This open ta...
متن کاملBlogVox: Separating Blog Wheat from Blog Chaff∗
Blog posts are often informally written, poorly structured, rife with spelling and grammatical errors, and feature non-traditional content. These characteristics make them difficult to process with standard language analysis tools. Performing linguistic analysis on blogs is plagued by two additional problems: (i) the presence of spam blogs and spam comments and (ii) extraneous non-content inclu...
متن کاملLogic Blog
1. Jan 2010: a downward GL1 set that is not weakly jump traceable 2 1.1. Original result 2 1.2. New results 2 1.3. Comments 2 2. March 2010: Structures that are computable almost surely 2 2.1. A structure that is computable almost surely, is computable in every Π1 random 2 2.2. Related questions 3 3. May 2010 : Cooper’s jump inversion and weak jump traceability 3 4. May 2010: The collection of ...
متن کاملFinding patterns in blog shapes and blog evolution
Can we cluster blogs into types by considering their typical posting and linking behavior? How do blogs evolve over time? In this work we answer these questions, by providing several sets of blog and post features that can help distinguish between blogs. The first two sets of features focus on the topology of the cascades that the blogs are involved in, and the last set of features focuses on t...
متن کاملHIT_LTRC at TREC 2010 Blog Track: Faceted Blog Distillation
This paper describes our participation in the faceted blog distillation task at Blog Track 2010. In our approach, indri toolkit is applied for basic topic relevance retrieval. Then the Maximum Entropy (ME) model is adopted to judge the relevance of each blog to specified facet. Feed faceted relevance is calculated by integrating the average relevance of all blogs within a feed and the average r...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Index on Censorship
سال: 2008
ISSN: 0306-4220,1746-6067
DOI: 10.1080/03064220701882822